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(57) Abstract 



The present invention involves demodulating a signal in a spread spectrum multiple access system employing a pilot on the toward 
link. The rake receiver (10) separates the signal processing based on the period over which the processing occurs. Symbol rate processing 
is performed by a single time-shared mutiply-accumulate datapath (34) that services multiple finger front ends (312) and a searcher front 
end (314). The front ends (312) are dedicated circuits that perform all chip rate processing, producing a data vector and asserting a flag 
indicating the results are ready to be serviced by the shared datapath. A datapath controller (308) arbitrates use of the datapath between 
the finger front ends (312), the searcher front end (314), and combining functions, configuring the datapath to service them on a first-come, 
first-serve basis. The controller (308) sequences the datapath through a fixed routine as dictated by the signal processing associated with 
the block being serviced. 
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RAKE RECEIVER ARCHITECTURE FOR A MOBILE DEMODULATOR USED IN A 
CDMA COMMUNICATION SYSTEM 



5 

BACKGROUND OF THE INVENTION 

I. Field of the Invention 

10 This invention relates generally to spread spectrum communication 

systems and, more particularly, to signal processing in a cellular telephone 
communication system. 

II. Description of the Related Art 

15 

In a wireless radiotelephone communication system, many users 
communicate over a wireless channel to connect to wireline telephone 
systems. Communication over the wireless channel can be one of a variety of 
multiple access techniques that allow a large number of users in a limited 

20 frequency spectrum. These multiple access techniques include time division 
multiple access (TDMA), frequency division multiple access (FDMA), and 
code division multiple access (CDMA). 

The CDMA technique has many advantages. An exemplary CDMA 
system is described in U.S. Patent No. 4,901,307, entitled Spread Spectrum 

25 Multiple Access Communication System Using Satellite or Terrestrial 
Repeaters, issued February 13, 1990 and assigned to the assignee of the present 
invention. 

In the '307 patent, a multiple access technique is disclosed where a large 
number of mobile telephone system users, each having a transceiver, 
30 communicate through satellite repeaters or terrestrial base stations using 
CDMA spread spectrum communication signals. The base station-to-mobile 
station signal transmission path is referred to as the forward link and the 
mobile station-to-base station signal transmission path is referred to as the 
reverse link. 

35 In using CDMA communications, the frequency spectrum can be 

reused multiple times thus permitting an increase in system user capacity. 
Each base station provides coverage to a limited geographic area and links the 
mobile stations in its coverage area through a cellular system switch to the 
public switched telephone network (PSTN). When a mobile station moves to 

40 the coverage area of a new base station, the routing of that user's call is 
transferred to the new base station. 
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The CDMA modulation techniques discussed in the 307 patent and in 
U.S. Patent No. 5,102,459 entitled System and Method For Generating Signal 
Waveforms in a CDMA Cellular Telephone System, issued June 25, 1990 and 
assigned to the assignee of the present invention, mitigate the special 
5 problems of the terrestrial channel, such as multipath and fading. Instead of 
being a drawback, as it is with narrowband systems, separable multipath can be 
diversity combined in a mobile rake receiver for enhanced modem 
performance. In the mobile radio channel, multipath is created by reflection of 
the signal from obstacles in the environment, such as buildings, trees, cars, 

10 and people. In general the mobile radio channel is a time varying multipath 
channel due to the relative motion of the structures that create the multipath. 
For example, if an ideal impulse is transmitted over the time varying 
multipath channel, the received stream of pulses would change in time 
location, attenuation, and phase as a function of the time that the ideal 

15 impulse was transmitted. 

The multipath properties of the terrestrial channel produce, at the 



T-oroiUDr cirmalq hQirinrr frairalan cottqi'') i Hie ^. - - 



characteristic of a multipath channel is the time spread introduced in a signal 
that is transmitted through the channel The spread spectrum pseudonoise 

20 (PN) modulation used in a CDMA system allows different propagation paths 
of the same signal to be distinguished and combined, provided the difference 
in path delays exceeds the PN chip duration. If a PN chip rate of approximately 
I MHz is used in a CDMA system, the full spread spectrum processing gain/ 
equal to the ratio of the spread bandwidth to the system data rate, can be 

25 employed against paths having delays that differ by more than one 
microsecond. A one microsecond path delay differential corresponds to a 
differential path distance of approximately 300 meters. The urban 
environment typically provides differential path delays in excess of one 
microsecond 

30 Another characteristic of the multipath channel is that each path 

through the channel may cause a different attenuation factor. For example, if 
an ideal impulse is transmitted over a multipath channel, each pulse of the 
received stream of pulses generally has a different signal strength than other 
received pulses. 

35 Yet another characteristic of the multipath channel is that each path 

through the channel may cause a different phase on the signal. If, for example, 
an ideal impulse is transmitted over a multipath channel, each pulse of the 
received stream of pulses generally has a different phase than other received 
pulses. This can result in signal fading. 
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A fade occurs when multipath vectors are added destructively, yielding 
a received signal that is smaller than either individual vector. For example, if 
a sine wave is transmitted through a multipath channel having two paths 
where the first path has an attenuation factor of X dB, a time delay of d with a 
5 phase shift of Q radians, and the second path has an attenuation factor of X dB, 
a time delay of d with a phase shift of Q + p radians, no signal would be 
received at the output of the channel. 

As described above, the PN chip interval defines the minimum 
separation two paths must have in order to be combined. Before the distinct 

10 paths can be demodulated, the relative arrival times (or offsets) of the paths in 
the received signal must first be determined. The demodulator performs this 
function by "searching" through a sequence of offsets and measuring the 
energy received at each offset. If the energy associated with a potential offset 
exceeds a certain threshold, a demodulation element, or "finger" may be 

15 assigned to that offset. The signal present at that path offset can then be 
summed with the contributions of other fingers at their respective offsets. 

A method and apparatus of finger assignment based on searcher and 
finger energy levels is disclosed in co-pending U.S. Patent Application. This 
application is Serial No. 08/144,902 entitled Finger Assignment in a System 

20 Capable of Receiving Multiple Signals, filed October 28, 1993 and assigned to 
the assignee of the present invention. 

FIG. 1 shows an exemplary set of signals from a base station arriving at 
the mobile station. The vertical axis represents the power received on a 
decibel (dB) scale. The horizontal axis represents the delay in the arrival time 

25 of a signal due to multipath delays. The axis (not shown) going into the page 
represents a segment of time. Each signal spike in the common plane of the 
page has arrived at a common time but was transmitted by the base station at a 
different time. 

In a common plane, peaks to the right were transmitted at an earlier 
30 time by the base station than peaks to the left. For example, the left-most peak 
spike 2 corresponds to the most recently transmitted signal. Each signal spike 2 
- 7 has traveled a different path and therefore exhibits a different time delay 
and a different amplitude response. 

The six different signal spikes represented by spikes 2-7 are 
35 representative of a severe multipath environment. Typical urban 
environments produce fewer usable paths. The noise floor of the system is 
represented by the peaks and dips having lower energy levels. 

The task of the searcher is to identify the delay as measured by the 
horizontal axis of signal spikes 2 - 7 for potential finger assignment. The task 
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of the finger is to demodulate one of a set of the multipath peaks for 
combination into a single output. It is also the task of a finger, once assigned 
to a multipath peak, to track that peak as it may move in time. 

The horizontal axis can also be thought of as having units of PN offset. 
5 At any given time, the mobile station receives a variety of signals from a base 
station, each of which has traveled a different path and may have a different 
delay than the others. The base station's signal is modulated by a PN sequence. 
A local copy of the PN sequence is also generated at the mobile station. Also at 
the mobile station, each multipath signal is individually demodulated with a 

10 PN sequence code aligned to its received time offset. The horizontal axis 
coordinates can be thought of as corresponding to the PN sequence code offset 
that would be used to demodulate a signal at that coordinate. 

Note that each of the multipath peaks varies in amplitude as a function 
of time, as shown by the uneven ridge of each multipath peak. In the limited 

15 time shown, there are no major changes in the multipath peaks. Over a more 
extended time range, multipath peaks disappear and new paths are created as 
time progresses. The peaks can also slide to earlier or later offsets as the path 
distances change when the mobile station moves around in the base station's 
area of coverage. Each finger tracks these small variations in the signal 

20 assigned to it. 

In narrowband systems, the existence of multipath in the radio channel 
can result in severe fading across the narrow frequency band being used. Such 
systems are capacity constrained by the extra transmit power needed to 
overcome a deep fade. As noted above, CDMA signal paths may be 

25 discriminated and diversity combined in the demodulation process. 

Three major types of diversity exist: time diversity, frequency diversity, 
and space/path diversity. Time diversity can best be obtained by the use of 
repetition, time interleaving, and error correction and detection coding that 

30 form of time diversity. 

CDMA, by its inherent wideband nature, offers a form of frequency 
diversity by spreading the signal energy over a wide bandwidth. The frequency 
selective fading that can cause a deep fade across a narrowband system's 
frequency bandwidth usually only affects a fraction of the frequency band 

35 employed by the CDMA spread spectrum signal. 

The rake receiver provides path diversity through its ability to combine 
multipath delayed signals; all paths that have a finger assigned to them must 
fade together before the combined signal is degraded. Additional path 
diversity is obtained through a process known as "soft hand-off" in which 
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5 

multiple simultaneous, redundant links from two or more base stations can 
be established with the mobile station. This supports a robust link in the 
challenging environment at the cell boundary region. Examples of path 
diversity are illustrated in U.S. Patent No. 5,101,501 entitled Soft Hand-Off in a 
CDMA Cellular Telephone System, issued March 21, 1992 and U.S. Patent No. 
5,109,390 entitled Diversity Receiver in a CDMA Cellular Telephone System, 
issued April 28, 1992, both assigned to the assignee of the present invention. 

Both the cross-correlation between different PN sequences and the 
autocorrelation of a PN sequence, for all time shifts other than zero, have a 
nearly zero average value. This allows the different user signals to be 
discriminated upon reception. Autocorrelation and cross-correlation require 
that logical "0" take on a value of "1" and logical "1" take on a value of "-1", or 
a similar mapping, in order that a zero average value be obtained. 

However, such PN signals are not orthogonal. Although the cross- 
15 correlation essentially averages to zero over the entire sequence length for a 
short time interval, such as an information bit time, the cross-correlation is a 
random variable with a binomial distribution. As such, the signals interfere 
with each other in much the same manner as if they were wide bandwidth 
Gaussian noise at the same power spectral density. 
20 It is well known in the art that a set of n orthogonal binary sequences, 

each of length n, for n any power of 2 can be constructed (see Digital 
Communications with Space Applications, S.W. Golomb et al, Prentice-Hall, 
Inc., 1964, pp. 45-64). In fact, orthogonal binary sequence sets are also known 
for most lengths that are multiples of four and less than two hundred. One 
25 class of such sequences that is easy to generate is called the Walsh function; a 
Walsh function of order n can be defined recursively as follows: 



W \72h nh 

w(n/ 2 \ W (n/ 2 



(1) 



30 where W denotes the logical complement of W, and W(1) = |0|. 

A Walsh sequence or code is one of the rows of a Walsh function 
matrix. A Walsh function matrix of order n contains n sequences, each of 
length n Walsh chips. A Walsh function matrix of order n (as well as other 
orthogonal functions of length n) has the property that over the interval of n 

35 bits, the cross-correlation between all the different sequences within the set is 
zero. Every sequence in the set differs from every other sequence in exactly 
half of its bits. It should also be noted that there is always one sequence 
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containing all zeroes and that all the other sequences contain half ones and 
half zeroes. 

In the system described in the '459 patent, the call signal begins as a 9600 
bit per second information source which is then converted by a rate 1/2 
5 forward error correction encoder to a 19,200 symbols per second output 
stream. Each call signal broadcast from a cell is covered with one of sixty-four 
orthogonal Walsh sequences, each sixty-four Walsh chips, or one symbol, in 
duration. Regardless of the symbol being covered, the orthogonality of all 
Walsh sequences ensures that all interference from other user signals in that 
10 cell are canceled out during symbol integration. The non-orthogonal 
interference from other cells limits capacity on the forward link. 

All user signals transmitted by a base station are quadrature phase shift 
key (QPSK) spread using the same in-phase (I) channel PN sequence and 
quadrature (Q) channel PN sequence. Each base station in a CDMA system 
15 transmits in the same frequency band using the same PN sequence, but with a 
unique offset relative to an unshifted PN sequence aligned to a universal time 

* w*._w. j. * , v,ca%*A*tg, iGic 10 vi iv. i>dix lc cio me rVaiDii i_uvd idle, 

1.2288 MHz, or 64 PN chips per symbol. In the preferred embodiment, each 
base station transmits a pilot reference. 

20 The pilot channel is a "beacon" transmitting a constant zero symbol 

and spread with the same I and Q PN sequences used by the traffic bearing 
signals. In the preferred embodiment, the pilot channel is covered with the all 
zero Walsh sequence 0. During initial system acquisition the mobile searches 
all possible shifts of the PN sequence and once it has found a base station's 

25 pilot, it can then synchronize itself to system time. As detailed below, the pilot 
plays a fundamental role in the mobile demodulator rake receiver 
architecture well beyond its use in initial synchronization. 

FIG. 2 depicts a radio's generic rake receiver demodulator 10 for 
receiving and demodulating the forward link signal 20 arriving at the 

30 antenna 18. The analog transmitter and receiver 16 contain a QPSK 
downconverter chain that outputs digitized I and Q channel samples 32 at 
baseband. The sampling clock, CHIPX8 40, used to digitize the receive 
waveform, is derived from a voltage controlled temperature compensated 
local oscillator (TCXO). 

35 The demodulator 10 is supervised by a microprocessor 30 through the 

databus 34. Within the demodulator, the I and Q samples 32 are provided to a 
plurality of fingers 12a-c and a searcher 14. The searcher 14 searches out 
windows of offsets likely to contain multipath signal peaks suitable for 
assignment of fingers 12a-c. For each offset in the search window, the searcher 
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14 reports the pilot energy it found at that offset to the microprocessor. The 
fingers 12a-c are then surveyed, and those unassigned or tracking weaker 
paths are assigned by the microprocessor 30 to offsets containing stronger 
paths identified bv searcher 14. 
5 Once a finger 12a-c has locked onto the multipath signal at its assigned 

offset it then tracks that path on its own until the path fades away or until it is 
reassigned using its internal time tracking loop. This finger time tracking loop 
measures energy on either side of the peak at the offset at which the finger is 
currently demodulating. The difference between these energies forms a metric 

10 which is then filtered and integrated. 

The output of the integrator controls a decimator that selects one of the 
input samples over a chip interval to use in demodulation. If a peak moves, 
the finger adjusts its decimator position to move with it. The decimated 
sample stream is then despread with the PN sequence consistent with the 

15 offset to which the finger is assigned. The despread I and Q samples are 
summed over a symbol to produce a pilot vector (Pi, Pq). These same 
despread I and Q samples are Walsh uncovered using the Walsh code 
assignment unique to the mobile user and the uncovered, despread I and Q 
samples are summed over a symbol to produce a symbol data vector (Di, Dq). 

20 The dot product operator is defined as 

P(n) • D(n) = PKnpKn) + P Q (n)D Q (n) (2) 

where Pi(n) and PQ(n) are respectively the I and Q components of the pilot 
25 vector P for symbol n and Di(n) and D Q (n) are respectively the I and Q 
components of the data vector D for symbol n. 

Since the pilot signal vector is much stronger than the data signal 
vector it can be used as an accurate phase reference for coherent 
demodulation; the dot product computes the magnitude of the data vector 
30 component in phase with the pilot vector. As described in co-pending U.S. 
Application No. 07/981,034 entitled Pilot Carrier Dot Product Circuit and 
assigned to the assignee of the present invention, the dot product weights the 
finger contributions for efficient combining, in effect scaling each finger 
symbol output 42a-c by the relative strength of the pilot being received by that 
35 finger. Thus the dot product performs the dual role of both phase projection 
and finger symbol weighting needed in a coherent rake receiver demodulator. 

Each finger has a lock detector circuit that masks the symbol output to 
the combiner 42 if its long term average energy does not exceed a minimum 
threshold. This ensures that only fingers tracking a reliable path will 
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contribute to the combined output, thus enhancing demodulator 
performance. 

Due to the relative difference in arrival times of the paths to which 
each finger 12a-c is assigned, each finger 12a-c has a deskew buffer that aligns 
5 the finger symbol streams 42a-c so that the symbol combiner 22 can sum them 
together to produce a "soft decision" demodulated symbol. This symbol is 
weighted by the confidence that it correctly identifies the originally 
transmitted symbol. The symbols are sent to a deinterleaver/decoder circuit 28 
that first frame deinterleaves and then forward error correction decodes the 

10 symbol stream using the maximum likelihood Viterbi algorithm. The 
decoded data is then made available to the microprocessor 30 or to other 
components, such as a speech vocoder, for further processing. 

On the reverse link, to maximize system capacity, it is important that 
all signals from mobiles are received at the cell at the same signal strength. A 

15 closed loop power control method, disclosed in U.S. Patent No. 5,056,109 
entitled Method and Apparatus for Controlling Transmission Power in a 
CDMA Cellular Mobile Telephone System, issued October 8, 1991 and assigned 
to the assignee of the present invention, is employed. 

The closed loop power control method operates by the cell measuring 

20 the signal being received by the mobile and sending a command to the mobile 
to increase or decrease its power level on a punctured sub-channel on the 
forward link. A power control symbol combiner 24 extracts these punctured 
^, ^..^v, w i^** i^iryuiw lAnrv c? till inmig Liic zyinuui uutputs iroin me ringers 
42a-c, and making a hard decision whether or not to adjust power up or down. 

25 These decisions are integrated to provide a transmit gain reference level 
output, TXGAIN 38, to the transmit power amplifier in the analog transmitter 
and receiver 16. 

To demodulate correctly, a mechanism is needed to align the local 
oscillator frequency with the clock used at the cell to modulate the data. Each 
30 finger makes an estimate of the frequency error by measuring the rotation rate 
of the pilot vector in QPSK I, Q space using the cross product vector operator: 

P{n) X P(n-1) = Pi(n)P Q (n-l) - Pi(n-i)P Q (n) (3) 

35 The frequency error estimates from each finger 44a-c are combined and 

integrated in frequency error combiner 26. The integrator output, LO_ADJ 36, 
is then fed to the voltage control of the TCXO in the analog transmitter and 
receiver 16 to adjust the clock frequency of the CHIPX8 clock 40, thus 
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providing a closed loop mechanism for compensating for the frequency error 
of the local oscillator. 

1^^' In a typical dedicated circuit implementation of the mobile rake 

receiver demodulator, each of the fingers, the searcher, and combiners are 
5 implemented separately as discrete circuits with each having a direct 
correspondence to some circuit area on an integrated circuit (IC) die. Each of 
these blocks are self-contained and responsible for their own signal processing 
tasks and, as such, the blocks in turn have their own separate accumulators, 
multipliers, and comparators. These dedicated circuits, especially the 

10 numerous multiply-accumulators needed for each finger, require a large 
amount of die area to implement. 

In a typical digital signal processor (DSP) implementation of the 
demodulator, each of the fingers, the searcher, and combiners are 
implemented as separately coded subroutines in a demodulator task. There 

15 are many simple operations that must be performed in the fingers and the 
searcher at the PN chip rate. A general DSP architecture may need to execute 
up to 75 million instructions per second (MIPS) to perform the chip rate 
processing for the three fingers and the searcher of FIG. 2 in a system using a 
PN chip rate of 1.2288 MHz as in the preferred embodiment. A 75 MIPS DSP 

20 consumes a considerable amount of power. Power is at a premium in the 
mobile unit, which is often a hand-held consumer device. One significant 
advantage of the DSP approach is the flexibility of implementing 
demodulation algorithm changes through a firmware change compared to 
making physical circuit changes as is the case with the traditional dedicated 

25 circuit approach. 

Both the dedicated circuit and the general DSP implementations have 
their respective die area and power concerns, which are not resolved even 
after taking advantage of the feature size shrink in the latest IC fabrication 
processes. There is a resulting need for a more efficient demodulator. 



30 



SUMMARY OF THE INVENTION 



The present invention encompasses a method and apparatus for 
35 demodulating a signal in a spread spectrum multiple access communication 
system employing a pilot on the forward link. The demodulator architecture 
embodied in this invention results in a smaller area chip consuming less 
power and costing less than either a dedicated circuit or a general DSP 
implementation equivalent. 
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The dedicated symbol rate circuits, a majority of the area required to 
implement a finger or searcher using the traditional dedicated circuit 
approach, are removed from the fingers and searcher, their symbol rate 
functionality having been incorporated into the shared datapath processing. 
5 What remains, termed a finger front end or searcher front end to distinguish 
from the original finger and searcher, are dedicated circuits performing all 
chip rate processing associated with the finger or searcher respectively. 

The present invention separates the signal processing functions into 
two groups based on the period over which the processing occurs. Specifically, 
10 this new architecture uses a single time-shared multiply-accumulate (MAC) 
datapath that services a plurality of finger front ends and a searcher front end. 
The datapath performs ail symbol rate processing associated with the fingers 
and the searcher. 

This same datapath combines, at the symbol rate, the outputs of the 

15 fingers. This produces a demodulated symbol stream and a power control sub- 
channel decision stream that are used to control the transmit power on the 
reverse link, and a frequency error estimate used to adjust the local oscillator. 
In conjunction with the datapath, a small register file RAM stores all state 
information for signal processing operations that span more than a symbol. 

20 Once per symbol for the fingers or once per integration period for the 

searcher, the front ends produce a data vector consisting of the I and Q symbol 
integration results for the pilot, the traffic channel symbol data, and the pilot 
oemodulated half a chip offset from the current finger offset for time tracking, 
or in the case of the searcher, the I and Q integration for the pilot for each of 

25 the offsets being evaluated simultaneously. These outputs are buffered so that 
the datapath may have access to the accumulated data vector during one 
symbol while the front end accumulators are summing the data vector for the 
next symbol. Along with the data vector, the front ends assert a flag indicating 
they have produced results that need to be serviced by the shared datapath. 

30 A datapath control circuit arbitrates the use of the datapath between the 

finger front ends, the searcher front end, and the combining functions, on a 
first-come, first-serve basis. Once queued, the controller sequences the 
datapath through a fixed routine, selecting components of the data vector 
being operated on along with the state information stored in the register file 

35 RAM. The controller then configures the datapath to perform all of the 
accumulation, multiplication, and comparisons associated with the signal 
processing of the block being serviced. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 represents an exemplary severe multipath signal condition. 
5 FIG, 2 is a prior art block diagram of a mobile demodulator rake 

receiver. 

FIG. 3 is a block diagram of the finger functionality. 
FIG. 4 is a block diagram of the searcher functionality. 
FIG. 5 is a block diagram of the combiner blocks functionality. 
10 FIG. 6 is a block diagram of the shared datapath architecture mobile 

demodulator according to the invention. 

FIG. 7 is a block diagram of the finger front end. 
FIG. 8 is a block diagram of the searcher front end. 

FIG. 9 is a memory map of state information associated with the symbol 
15 rate signal processing of the demodulator. 

FIG. 10 is a sequencing timeline of the shared datapath while it services 
a finger. 

FIG. 11 is a sequencing timeline of the shared datapath while it services 
the searcher. 

20 FIG. 12 is a sequencing timeline of the shared datapath while it services 

the combiners. 



DESCRIPTION OF THE PREFERRED EMBODIMENT 

25 

As previously discussed, FIG. 2 is a top level functional overview of a 
rake receiver demodulator 10. The analog front end 16 receives the forward 
link signal 20 via an antenna 18, downconverts it to baseband and outputs 
digitized I and Q channel samples 32 to a plurality of fingers 12a-c and a 

30 searcher 14. The searcher 14 searches out windows of offsets likely to contain 
multipath signal peaks suitable for assignment of the fingers 12a-c. For each 
offset in the search window, the searcher 14 reports the pilot energy it found at 
that offset to the microprocessor 30. The fingers 12a-c are then surveyed and 
those unassigned or tracking weaker paths are assigned by the microprocessor 

35 30 to offsets containing stronger paths identified by the searcher 14. 

All of the fingers 12a-c contain the same functionality depicted in the 
finger functional block diagram of FIG. 3. In a traditional dedicated circuit 
implementation, each of the elements in FIG. 3 have a correspondence to a 
physical circuit; in a traditional general purpose DSP implementation, each of 

40 these elements have corresponding steps in the signal processing code. In the 
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preferred embodiment, a clear delineation between the processing occurring at 
chip rate and that at symbol rate is made is indicated by chip-symbol 
processing boundary 98. All elements operating on the chip level are shown 
to the left of boundary 98, and all elements operating at the symbol rate are 

5 shown to the right of boundary 98. 

The I and Q samples 32 are input to a decimator 102 that, based on the 
finger's assigned offset, selects one of the eight samples per chip to use for on- 
time processing and another sample half a chip delayed for use in time 
tracking. This sampling, as well as all other chip rate processing in the finger, 

10 is slaved to the chip enable strobe 156 from a finger timing generator 122. The 
finger timing generator 122 tracks the time offset of the multipath peak being 
demodulated. 

Each advance or retard generated, either due to a time tracking loop 
adjustment or a slew command by the microprocessor to move to a new 

15 offset, has the effect of slowing down or speeding up the rate at which the chip 
enable strobe 156 is generated, and the related symbol enable strobe 158 that in 
the preferred embodiment asserts with every sixty-fourth chip enable strobe 
156. The finger timing generator reflects any offset changes by incrementing or 
decrementing an internal finger position register read by the microprocessor. 

20 The finger timing generator 122 also contains an internal position assignment 
register written by the microprocessor to slew the finger to a new offset during 
finger reassignment. When the microprocessor reassigns a finger, an internal 
mechanism in the finger timing generator 122 continues to advance or retard 
the timing until it determines that the finger has reached its assigned offset. 

25 The decimated on-time and late I and Q chip samples are provided to 

QPSK despreaders 104a and 104b respectively. The despreaders 104 also 
receive, from the I Q FN sequence generator 106, the PN sequences identical to 
those used to spread the data at the base station. The I Q PN sequence 
generator 106 is slaved to the chip enable output 156 from the finger timing 

30 generator 122, and is thus generating the PN sequences consistent with the 
assigned offset of the finger. Another way of describing this is that the 
sequences output from the PN generator 106 are delayed from their 
counterpart sequences in the base station by the multipath propagation delay 
from the base station to the mobile unit. Therefore, the despreading process in 

35 the demodulator can reverse the spreading process in the modulator with the 
correct time alignment. 

In order to recover the originally transmitted data, despread I and Q 
chips are respectively output from on-time despreader 104a to exclusive-or 
(Xxjk) gates iOo. ine vvaisn scuuculc genej.«io± *w ~ — — — 0 — - 
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108 the Walsh chip sequence corresponding to the Walsh code assigned to the 
mobile unit to reverse the orthogonal covering applied at the base station. 

The Walsh code is conveyed to the finger through microprocessor 
databus 34. The despread and uncovered I and Q chips are summed by I and Q 
5 symbol accumulators 110 and 112 over a symbol interval, once per symbol 
producing a symbol data pair Di(n) and Dq(n) for symbol n. Since the pilot 
channel is covered with the all zeroes Walsh code 0, no separate Walsh 
sequence generator is needed to recover the pilot. The output of on-time 
despreader is directly summed by on-time I and Q accumulators 114 and 116, 

10 producing a pilot pair P](n) and PQ(n) for symbol n. 

The time tracking loop is driven by the difference of pilot strengths 
offset half a chip from the current finger offset. Therefore, a separate set of I 
and Q accumulators 118 and 120 sums the despread pilot provided by late 
despreader 104b using samples delayed half a chip from those used by the on- 

15 time pilot and symbol accumulators. To produce a pilot pair half a chip 
delayed from the on-time pilot pair, Pil(h) and Pql(h) for symbol n, late 
despreader 104b uses the same PN sequence used by on-time despreader 104a. 
To produce a pilot pair half a chip advanced from the on-time pilot pair, 
PiE(n) and Pqe(h) for symbol n, late despreader 104b uses a PN sequence 

20 delayed a chip from that used by despreader 104a. The time tracking loop 
makes use of the half chip advanced and delayed pilot pairs on alternating 
symbols. With each symbol enable strobe 158, the accumulators 110, 112, 114, 
116, 118, 120 are cleared and begin summing over the next symbol interval. 
The elements described above perform all of the chip rate processing 

25 occurring in the finger, drawn in FIG. 3 to the left of boundary 98. The net 
result of this chip rate processing is a data vector produced once per symbol: 

{ D,(n), D Q (n), Pi(n), P Q (n), Pie/lW, P QE /L(n)} 

30 which is then processed at the symbol rate by elements drawn to the right of 

boundary 98 in FIG. 3. 

The symbol rate processing typically starts by filtering the on-time I and 

Q pilot data Pj(n), PQ(n), as shown by I and Q Pilot filters 132 and 134 in FIG. 3. 

This filtering smoothes the symbol to symbol variations in the pilot reference, 
35 providing a steadier reference for the dot product's phase projection and 

scaling operations. 

In the preferred embodiment, I and Q pilot filters 132 and 134 are 

configured as simple first order infinite impulse response (IIR) filters. For 

each symbol, a fraction of the current filter value is subtracted off and a new 
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input, pilot data Pi(n), PQ(n) is summed in to produce a new filter output 
Pfi(n),PfQ(n). 

Once per symbol, Dot Product circuit 130 performs the dot product 
operation defined in Equation (2), taking the Di(n), DQ(n) symbol vector and 
5 dotting it with the filtered pilot vector Pfi(n), Pfq(n). This results in a scalar 
value indicative of the magnitude of the data symbol in phase with the pilot, 
scaled by the strength of the pilot being received. 

After truncation and limiting (not shown) used to renormalize the dot 
product result to the bits of interest, this symbol output is written into a 
10 symbol deskew buffer 144. The deskew buffer is a first-in, first-out (FIFO) 
buffer written with the finger's own particular symbol alignment as provided 
by symbol enable strobe 158. The deskew buffers in all fingers are read using 
the same combiner symbol enable strobe (not shown). This compensates for 
the different offsets the fingers are assigned and allows the symbol combiner 
15 22 to sum the symbol streams from the different fingers together. 

The symbol output of the deskew buffer is masked by an AND gate 152 
when the finger is in an out of lock state. The lock state 148 is an indicator 
that the finger is tracking a reliable and reasonably strong path, and masking 
the finger symbol output when the finger is out of lock produces a higher 
20 quality combined symbol stream output from the combiner 22. 

The signal processing to determine the lock state begins with the Energy 
circuit 140 using the I and Q pilot filter outputs to determine [Pf'i(n) 2 + PfQ(n) 2 ], 
corresponding to the energy in the pilot for the peak being tracked. This 
energy is then filtered by the lock detect filter 142 to produce a long term 
25 average finger energy level. During finger reassignment, the microprocessor 
30 can read this finger energy back and compare it to the latest multipath 
peaks found by searcher 14, reassigning the finger to stronger paths found by 
the searcher as the multipath environment changes and peaks come and go. 

In the preferred embodiment, the lock detect filter 142 is configured as a 
30 simple first order IIR filter. For each symbol, a fraction of the finger energy 
maintained in the filter is subtracted off and the energy result output from 
energy circuit 140 is summed in to produce a new filtered finger energy 
output. 

The threshold compare block 150 compares the finger energy output 
35 from the lock detect filter 142 to an in-Iock threshold and an out-of-lock 
threshold written into the block by the microprocessor 30. If the finger energy 
is above the in-lock threshold, lock state 148 is forced to the in-lock state. If the 
energy is below the out-of-lock threshold, the lock state 148 is forced to the 
out-of-iock state. Otherwise the lock state 148 remains unchanged. This 
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produces a hysteresis effect on lock state 148 in that, once the finger falls out of 
lock, its energy must rise above in-lock threshold to go back into lock, and 
once the finger is locked, the energy must fall below the out-of-lock threshold 
to go out of lock. 

5 Once per symbol, the cross product circuit 146 performs the cross 

product operation defined in Equation (3), crossing the filtered pilot Pfi(n), 
Pf<2(n) vector with the filtered pilot vector for the previous symbol, Pfi(n-l), 
PfQ(n-l). This results in a scalar value indicative of the rotation rate of the 
pilot in QPSK I, Q space, providing a measure of the frequency error between 

10 the local oscillator clock and that used to transmit the signal at the base 
station. After truncation and limiting (not shown) used to renormalize the 
cross product result to the bits of interest, this frequency error is masked by 
AND gate 154 when the finger is in an out of lock state, so that the finger only 
contributes to LO_ADJ signal 36 when it is tracking a reliable and reasonably 

15 strong path. 

As previously described, a time tracking loop keeps the finger centered 
on its assigned multipath peak as the peak shifts around while the mobile 
changes its position relative to objects in its environment, thus causing 
reflective multipath. On successive symbols, half chip offset pilot symbol 

20 integration pairs Pu/n), Pql(h) and Pre(n), Pqe(h) are alternately output by late 
symbol accumulators 118 and 120. Once per symbol, the energy circuit 136 
calculates either [PiL(n) 2 + PqlO" 1 ) 2 ] or [PiE(n) 2 + PQ E (n) 2 ], corresponding to the 
energy in the pilot half a chip offset later or earlier than the peak being 
tracked. The time track filter 138 calculates the difference between these two 

25 energies, 



[Pl L (n)2 + P QL (n) 2 ] - [PiE(n-l) 2 + P QE (n-l) 2 ] (4) 

This difference forms a metric used to drive a second order low pass 
30 filter. The gain of both the first and second order contributions are specified by 
the microprocessor 30. This allows for a wider filter bandwidth during initial 
acquisition, later changing to a narrower bandwidth that can better reject 
spurious out-of-band noise once the finger is in lock. The time track filter 
issues an advance or retard when the final phase accumulation stage 
35 overflows or underflows. This feeds back to the finger timing generator 122 
that respectively compresses or expands a chip period by a single CHIPX8 
clock. This adjusts the finger offset an eighth of a chip in the direction to 
recenter it onto the peak of the path being tracked. 
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After the microprocessor 30 specifies a search window starting offset 
(written to search timing generator 200), and a search window length (written 
to search control block 206), the searcher 14 steps through the search window, 
evaluating each offset in the search window in sequence. For each offset, the 
5 searcher integrates the pilot over a specified number of chips (written to 
searcher timing block 200), calculates the resulting pilot energy, and optionally 
sums several pilot energies over a specified number of intervals (written to 
search control block 206). The output of the searcher is a trace of the multipath 
environment in the search window that looks much like FIG* 1. The 
10 multipath trace may be returned to the microprocessor directly or, to reduce 
the amount of data the microprocessor needs to handle, the searcher can filter 
the results so as to report only a sorted list of the largest peaks found in the 
search window. 

Just as the finger processing was divided into chip rate and symbol rate 
15 processing, in the preferred embodiment, the searcher is divided into two 
functional groups as depicted by the searcher functional block diagram of FIG. 
4. In a traditional dedicated circuit implementation, each of the elements in 
FIG. 4 have a correspondence to a physical circuit; in a traditional general 
purpose DSP implementation, each of these elements have corresponding 
20 steps in the signal processing code. All elements operating on the chip level 
are shown to the left of boundary 198, and all elements operating once per 
integration interval are shown to the right of boundary 198. 

xai.^ ^t-tti^itt-A x-x xcf piuvmcu WALti l emu \^ sailtpiC^ 3^. UlcU silt HlpUt tO S 

decimator 102. Unlike the finger 12a-c, which can select one of eight 
25 decimations of the input data, the searcher decimator 102 always samples at 
fixed half chip offsets during a search. The decimator 102 can be fixed because 
the searcher only evaluates the search window in half chip increments, a 
coarse sweep that is still detailed enough to ensure that no likely candidate 
paths are missed. Once a finger is assigned to a path found by a searcher, it will 
30 quickly center itself on the path even if the peak falls between two half chip 
separated search results. The sampling, as well as all other chip rate processing 
in the searcher, is slaved to the chip enable strobe 214 from searcher timing 
generator 200. 

Each advance or retard generated, either due to a search retard 
35 generated by search control block 218 as it sequentially steps the searcher 
through the search window, or from a slew by microprocessor 30 to begin a 
new search at a different starting offset, has the effect of slowing down or 
speeding up the rate at which the chip enable strobe 214 is generated. Searcher 
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timing generator 200 also outputs a sum_done strobe 216 indicating a search 
integration interval has completed. 

The searcher timing generator 200 stores the net effect of all offset 
changes in an internal searcher position register that can be read by the 
5 microprocessor 30. The searcher timing generator 200 also contains an 
internal position assignment register written to by the microprocessor to slew 
the searcher to a new offset. When the microprocessor 30 slews the searcher 
14, an internal mechanism in the searcher timing generator 200 continues to 
advance or retard the searcher timing until it determines that the searcher 14 
10 has reached its assigned offset. Once it has reached its assigned offset, the 
searcher 14 begins the specified search starting with the first offset in the 
search window. 

As with the finger 12a-c, in the searcher 14 the decimated on-time and 
late I and Q chip samples are provided to QPSK despreaders 104a and 104b 

15 respectively. The despreaders 104 also receive from the I Q PN sequence 
generator 106 the PN sequences identical to those used to spread the data at 
the base station. The I Q PN sequence generator 106 is slaved to the chip 
enable output 214 from the searcher timing generator 200, and is thus 
generating the PN sequences consistent with the current offset being 

20 evaluated by the searcher. The searcher is only interested in measuring the 
pilot strength at each offset and therefore has no need for the Walsh sequence 
generator found in the finger. 

The output of on-time despreader 104a is directly summed by on-time I 
and Q accumulators 162 and 164 and the output of late despreader 104b is 

25 directly summed by late I and Q accumulators 166 and 168, producing a pilot 
pair Pi(n) and PQ(n) for symbol, producing a data vector once per integration 
interval: 

|P,(n), P Q (n), P IL (n),P QL (n)} 

30 

that is then processed at the integration interval rate by elements drawn to the 
right of boundary 198 in FIG. 4. 

It should be noted that in the preferred embodiment, two offsets, an on 
time and late pair, are being evaluated simultaneously by the searcher. This 
35 parallelism is needed to ensure the searcher produces the multipath trace for 
typical search windows at a rate faster than the multipath environment 
changes. The signal processing discussed for the preferred embodiment can 
also apply without loss of generality to additional despreader accumulator 
pairs that can be used to obtain additional searcher performance if needed. 
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After each integration interval, the energy circuit 202 calculates [Pi(n) 2 + 
PQ(n) 2 ], corresponding to the on-time pilot energy, and the energy circuit 204 
calculates {PilO*) 2 + PqlM 2 ]/ corresponding to the late pilot energy for the 
offset currently being evaluated by the searcher. The on-time pilot energy is 
5 optionally summed over several integration intervals by non-coherent 
accumulator 208, and similarly late pilot energy is summed by non-coherent 
accumulator 210. 

After the specified number of integration intervals have elapsed, the 
results in the non-coherent accumulators 208, 210 are passed to a search result 

10 processor 212. Searcher control block 206 then decrements its internal offset 
count and issues a retard to the searcher timing generator 200. This causes the 
searcher to step to the next offset in the search window. 

Despreading begins a PN sequence consistent with the new offset being 
evaluated, on-time and late accumulators 162, 164, 166, 168 are cleared and 

15 begin summing despread pilot chips for the new offset. Once the search 
control block has sequenced the searcher through the specified number of 
chips in the search window, it returns the searcher to an idle state until the 
searcher is again commanded to search another window. 

In the above mentioned co-pending U.S. Patent Application Serial 

20 No. 08/144,902 entitled Demodulation Element Assignment in a System 
Capable of Receiving Multiple Signals, the preferred embodiment assigns the 
fingers based on the best results found in a search window. In the preferred 
embodiment, the four best results are tracked in search result processor 212. (A 
lesser or greater number of results could be stored in other embodiments.) A 

25 result register internal to the result processor 212 stores a sorted list of the 
largest peaks found and their corresponding offsets. If the latest search results 
provided by non-coherent accumulators 208 or 210 exceed those stored in the 
best result list, the control logic in the result processor 212 discards the fourth 
best result and inserts the new energy and corresponding offset at its 

30 appropriate position in the list. There are a great number of methods well 
known in the art for providing such a sorting function. Any one of them 
could be used within the scope of this invention. 

The search result processor 212 also has a local maxima filter function 
comparing current energy with energy obtained at the adjacent offset. The 

35 local maxima filter, if enabled, prevents the best result list from being updated 
even though a result would otherwise qualify for inclusion, unless the result 
represents a local multipath peak. In this way, the local maxima filter 
prevents strong, broad "smeared" multipath from filling multiple entries in 
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the best result list, leaving no room for weaker but distinct multipath that can 
make better candidates for demodulation. 

The implementation of the local maxima filter is straightforward. The 
current search result is compared to the preceding offset's result, the 
5 comparison result indicating the slope of the peak being traced. A slope 
transition from positive to negative indicates a local maxima and enables the 
best result list to be updated. The slope latch can be appropriately initialized 
and followed through so boundary edge offsets can be considered for inclusion 
as well. 

10 At the end of a search, the best result list is provided to the 

microprocessor. Having the searcher filter the results so that only the largest 
peaks need to be looked at by the microprocessor 30 significantly reduces the 
amount of processing the microprocessor 30 spends on its searcher task. 

FIG. 5 is a functional overview of the processing for the symbol 

15 combiner 22, the power combiner 24 and the frequency error combiner 26 in 
the mobile demodulator of FIG. 2. Once per symbol, the symbol combiner 
takes the deskewed symbol streams 42a-c from the three fingers, sums them 
via adder 262 and, after truncation and limiting (not shown), descrambles the 
combined soft decision symbol via XOR gate 270 of FIG. 6 using a user-specific 

20 long code 280 time aligned to a counterpart in the cell. The user long code 280 
is unique to each user and configured with parameters that have not been 
broadcast over the air during call setup, thus providing some measure of 
privacy. The user PN generator is embedded in and time aligned with a 
combiner timing generator 264. The combiner timing generator 264 outputs a 

25 combiner symbol strobe 282 independent of the finger symbol strobes 158a-c, 
which has been previously mentioned in its role enabling simultaneous reads 
from the symbol deskew buffers 144 in the fingers 12a-c. 

Combiner timing generator 264 has an input TX_PCG signal 278 
coming from the modulator section of the modem (not shown), indicating 

30 that the mobile unit transmitted on the reverse link during the prior power 
control group. In the preferred embodiment, the power control group is a 1.25 
msec interval over which the mobile unit may gate its transmission on the 
reverse link. If the mobile transmitted, TXJPCG 278 tells the combiner to 
listen for a power control decision on the closed loop power control decision 

35 sub-channel on the forward link. 

Bits sampled from the user PN sequence 280 determine which of the 
forward link traffic symbols within a power control group are punctured to 
provide the power control decision bits. In the preferred embodiment, 
depending on the application the power control decision may puncture one or 
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two symbols. During the punctured symbol, the combiner timing generator 
264 asserts the PUNCT signal 284. This masks the symbol data so that an 
erasure is placed in the symbol data stream 46 provided to the deinterleaver 
and decoder. Given the powerful forward error correction codes used in the 
5 preferred embodiment, the decoder 28 can reconstruct the punctured symbols. 

The power combiner 24 uses the same three deskewed finger symbol 
streams used by the symbol combiner 22. The power combiner 24 is actually 
three separate adder accumulator pairs allowing the mobile to monitor power 
decisions from three different cells. Normally only one of these adder- 
10 accumulator pairs is active, but in two way or three way soft hand-off, the 
mobile can take power decisions from two or three cells at the same time. 

Cell 0 uses adder 246 and accumulator 252; Cell 1 uses adder 248 and 
accumulator 254; Cell 2 uses adder 250 and accumulator 256. Once per symbol, 
adders 246, 248 and 250 sum the symbol streams 42a-c from the three fingers 
15 12a-c. Accumulators 252, 254 and 256 sum the resulting combined symbols 
over two successive symbols if two-symbol puncturing is used. During soft 
hand-off, the fingers 12a-c can be reassigned freely between cells as the 
multipath environment for each cell changes. 

To provide maximum flexibility, AND gates 240a-c / 242a-c, and 244a-c 
20 provide the microprocessor 30 a means to switch fingers from one cell to 
another. For example, when not in soft hand-off. only the cell 0 adder- 
accumulator pair 246 252 is used. All three AND gates 240a-c are enabled, 
while the AND gates 242a-c and 244a-c for cell 1 and 2 respectively are 
disabled, masking the finger contributions to adder-accumulator pairs 248 254 
25 and 250 256, effectively shutting them down. 

In a three way soft hand-off, one finger is assigned to each cell and one 
each of AND gates 240a-c 242a-c and 244a-c are enabled with the other two 
AND gates from each grouping disabled, so that all three adder-accumulator 
pairs are active. The sign bit in accumulators 252, 254, 256 form a hard "up=0" 
30 or "down=l" decision. 

In a soft hand-off, if any one cell is requesting the mobile to turn down 
its transmit power, this indicates that the mobile is coming through loud and 
clear at that ceil and the other ceils decisions should be ignored. This logic is 
reflected in the "or~of-the-downs" OR gate 258 which combines the power 
35 decisions from the active cells. The output of OR gate 258, representing the 
final combined decision, is summed in TXGAIN accumulator 268. 

TXGAIN accumulator is enabled by the PUNCT signal 284, ensuring 
that its transmit gain output is adjusted only in response to the power 
decision symbols. The TXGAIN value is converted to an analog voltage level 
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by externally R-C filtering the TXGAIN output of pulse density modulator 
(PDM) 276, which outputs a pulse train whose density over a set time interval 
is proportional to the input value provided by TXGAIN ACCUM 268. 

Once per symbol, the frequency error combiner 26 takes the frequency 
5 error streams 44a-c from the three fingers, sums them via adder 260, and after 
truncation and limiting (not shown), sums the combined frequency error into 
LO_ADJ accumulator 266 to provide a local oscillator adjustment reference. 
The LO__ADJ value is converted to an analog voltage level by externally R-C 
filtering the LO_ADJ output 36 of PDM 274. The PDM 274 outputs a pulse 

10 train whose density over a set time interval is proportional to the input value 
provided by LO_ADJ ACCUM 266. 

In a traditional dedicated circuit implementation, each multiplier, 
accumulator or comparator described in FIGs. 3, 4 and 5 is implemented 
separately as a discrete circuit, with each element having a direct 

15 correspondence to some circuit area on an integrated circuit (IC) die. Of 
particular concern are the four multiplier accumulators used to perform on- 
time pilot filter energy, early or late pilot filter energy, the cross product 
operation, and the dot product operation, replicated for each finger. 

These structures take a sizable amount of die area to implement, and 

20 the inventors recognized with a full symbol to complete the processing, the 
functionality could be more efficiently implemented using a shared datapath. 
The resulting hybrid architecture, containing elements of both the dedicated 
circuit and general DSP approaches, is shown in FIG. 6. All of the finger chip 
rate circuits shown to the left of line 98 in FIG. 3 and the searcher chip rate 

25 circuits shown to the left of line 198 in FIG. 4 are preserved in residual 
dedicated circuits depicted in FIG. 6 as finger front ends 312 and searcher front 
end 314 respectively. All of the finger symbol rate processing to the right of 
line 98 in FIG. 3, all of the searcher per integration interval processing to the 
right of line 198 in FIG. 4, and the combiner functions of FIG. 5 have been 

30 integrated into a shared multiply-accumulator datapath 300. 

Once per symbol, the finger front ends 312 produce a data vector 
consisting of the I and Q symbol integration for the traffic channel symbol 
data, the on-time pilot, and the early or late pilot. Once per integration 
interval, the searcher front end 314 produces a data vector consisting of the I 

35 and Q symbol integration results for the on-time and late pilot. Components 
of the data vectors are accessed by the shared datapath through a tri-state bus 
174 shared in common by the finger front ends and the searcher front end. 

Once per symbol, combiner timing generator 264 outputs a combiner 
symbol enable 282, finger front ends output their individual symbol enables 
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158a-c, and once per searcher integration interval, searcher outputs a 
sum_done signal 216. Datapath control circuit 308 uses these strobes to 
arbitrate the use of datapath 300 between the finger front ends 312, the 
searcher front end 314 and combining functions on a first-come, first-serve 
5 basis. Once queued, the controller 308 sequences the datapath 300 through a 
section of microcoded instructions stored in the microcode ROM 306. The 
microcode configures the internal elements of datapath 300 to allow it to 
perform all of the accumulations, multiplication, and comparisons needed for 
the signal processing of the block being serviced. The controller reads and 
10 writes from a random access memory (RAM) 304 which acts as a register file 
storing all demodulator state information preserved across symbol 
boundaries. These include such items as the deskew memory and various 
filter values for each finger 12a-c, and sorted list of largest peaks found for the 
searcher 14. 

15 FIG. 7 is a block diagram of a finger front end 312. It performs the 

identical function of the chip rate processing detailed earlier for the finger of 
FIG. 3 up to chip accumulators 110, 112, 114, 116, 118, 120. On the finger symbol 
enable strobe 158, the data vector output of these accumulators is latched by 
half latches 350a-f, buffering the data vector so that the finger chip 

20 accumulators may begin summing the data vector for the next symbol while 
the values latched in half latches 350a-f await their turn to be processed by 
shared datapath 300. Half latches 350a-f are tri-state buffered 352a-f to allow 
them to be output on a common bus shared between all finger front ends and 
searcher front end. The tri-state bus 174 is a distributed multiplexer; datapath 

25 control 308 selects one of the tri-state drivers 352a-f in a finger front end or the 
searcher front end to drive out onto the bus. The tri-state bus 174 provides the 
datapath access to ail of the various data vector components with minimal 
routing overhead. The finger front end timing generator 122 accepts an 
external advance or retard 160 generated by datapath control 308 as it updates 

30 the values of that finger's time tracking filter. 

FIG. 8 is a block diagram of the searcher front end 314. It performs the 
identical function of the chip rate processing detailed earlier for the searcher 
of FIG. 4 up to chip accumulators 162, 164, 166, 168. On the searcher 5um_done 
boundary 216, the data vector output of these accumulators is latched by half 

35 latches 360a-d. buffering the data vector so that the searcher chip accumulators 
may begin summing the data vector for the next symbol while the values 
latched in half latches 360a-d await their turn to be processed by shared 
datapath 300. Half latches 360a-d are tri-state buffered 362a-d to allow them to 
be output on a common bus 174 shared with the finger front ends. Datapath 
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control 308 selects one of the tri-state drivers 362a-d to drive out onto the bus 
while the datapath is servicing the searcher. The searcher front end timing 
generator 200 accepts an external searcher retard generated by datapath control 
308 as it finishes processing one offset and moves to consider the next offset in 
5 the search window. 

Returning to FIG. 6, datapath 300 includes two operand input latches 
322, 326. These operand latches can contain values from either the tri-state 
data vector bus 174 or from register file RAM 304, independently selected via 
multiplexer (MUX) 320 and MUX 324. For example, when squaring Pil(u) for 

10 the half chip late pilot energy calculation used in time tracking; in this case 
MUX's 320 and 324 both select the input from the data vector input bus. When 
performing the dot product, filtered pilot Pfi(n) is read from the register file 
RAM which is selected by MUX 324 and captured by latch 326, while Di(n) for 
the finger being serviced is driven onto the data vector input bus which is 

15 selected by MUX 320 and captured by latch 322. The two operand latches are 
multiplied by multiplier 328. 

Multiplier 328 is a full parallel combinational multiplier yielding the 
product of the two operands within a single clock cycle. Either the multiplier 
output or the operand stored in latch 326 is selected via MUX 330 to be 

20 summed by adder 334 with accumulator feedback latch 342. 

All arithmetic operations in the datapath are performed using 2's 
complement representation of values, so by performing a l's complement 
inversion using XOR gate 332 and fixing the carry in to the LSB of adder to 1, 
the output of MUX 330 may be conditionally subtracted instead of added. AND 

25 gate 336 can conditionally mask the feedback of accumulator latch 342 
summing into adder 334, so the output of MUX 330 may be loaded into the 
accumulator latch 342 instead of being summed with its prior contents. 

The output of the adder feeds a programmable limit stage 338 that, 
together with a programmable normalization stage 340, selects only the adder 

30 output bits of interest for the operation being performed. By renormalizing 
the results after each operation, truncating the LSBs that are well below the 
system noise floor, and saturating any MSB's that rarely if ever get set, all 
values operated on can be maintained in at most a double precision word 
without ever encountering bit overflow. 

35 The data vector provided on tri-state bus 174 is a single precision word, 

10 bits wide in the preferred embodiment. Typically the finger symbol and 
frequency error stored in register file RAM 304 are single precision words, 
while filter values are stored as double precision words, 20 bits wide in the 
preferred embodiment. The register file RAM 304 consists of two banks that 
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can be accessed independently to access a single precision word or together to 
access a double precision word. 

A storage map of register file 304 that, in the preferred embodiment, is 
comprised of two RAM banks of 64 10-bit words, is shown in FIG. 9. Storage in 
5 the register file is divided into finger pages, a searcher page and a combiner 
page. The organization of the fields within a finger page is the same for each 
finger, so that the index of the finger front end being serviced forms the page 
select, and the finger state values accessed from register file 304 is specified as 
an offset into the selected page. For each finger 12a-c, the symbol deskew buffer 

10 memory, the I and Q pilot filters values and their delayed version for the cross 
product, the time track filter values, and the lock energy filter value are all 
stored in register file. 

The register file contains microprocessor write registers, namely the in- 
lock and out-of-lock thresholds, the initial finger energy and frequency 

15 accumulator term used in the second order time tracking loop, taken after the 
finger completes a slew. The register file also contains microprocessor read 
registers, namely the finger energy and frequency accumulator term used in 
the second order time tracking loop. These values are stored more efficiently 
in RAM than in discretely instantiated read and write latches; a 

20 microprocessor read /write tap 344 provides a port through which the 
microprocessor can read or write these values, temporarily halting the 
sequencing of the datapath while an access is made. The microprocessor 
accesses these values infrequently enough so that the any resulting delay in 
datapath sequencing is insignificant. 

25 For the searcher 14, the register file stores the intermediate values of 

the on-time and late non-coherent accumulators, as well as the prior energy 
value used in local maxima detection, and the sorted list of the four strongest 
peaks found by the searcher and their corresponding offset. For the combiner, 
the register file stores the state of cell accumulators 252, 254, and 256 when 

30 summing two consecutive punctured symbols, as well as the states of the 
TXGAIN accumulator 268 and the LO.ADJ accumulator 266. The initial value 
of TXGAIN and LO_ADJ accumulators 268, 266 can be specified by 
microprocessor 30, and their current value read back by microprocessor 30 
using the read/write tap 344. 

35 Returning to FIG. 6, the limited, normalized adder output is captured 

by accumulator latch 342. Accumulator latch 342 output feeds back to adder 
336 for further summing, and its contents can be written back into register file 
RAM 304. The output of Latch 342 is captured by TXGAIN PDM 274 and 
LO_ADJ PDM 276 at the appropriate time as updated TXGAIN or LCLADJ 



WO 97/01227 



PCMJS96/10575 



accumulator values respectively are written back into register file RAM 304. 
While servicing the combiner functions, the datapath produces at one point 
the combined symbol at the output of accumulator latch 342. The combined 
symbol is then scrambled by XOR gate 270 and erased by AND gate 272 by user 
5 PN sequence 280 and PUNCT signal 284 outputs of combiner timing generator 
264 respectively, as described previously for these same circuits shown in FIG. 
5. 

Due to ease of implementation, in the preferred embodiment, certain 
states affecting the sequencing of the datapath, such as the finger lock states, 

10 the slope latch for the local maxima filter, the read and write pointers for the 
finger deskew buffer, the non-coherent accumulation and current search 
offset count are realized by discrete latches and accompanying control logic 
internal to datapath control circuit 308, instead of through datapath 
sequencing and additional storage allocations in register file 304. By tracking 

15 the read and write deskew buffer pointers, the largest time interval handled by 
the combiner timing generator 264 or the finger timing generator 122 is their 
respective symbol interval strobes 282 and 158. 

The datapath control 308 uses the sign bit output 346 from adder 334 as 
a flag to control the datapath sequencing for the in-lock threshold, the out-of- 

20 lock threshold, the local maxima filter, and during the sorting of the best 
searcher result list. If sign bit 346 overflows while computing the new time 
track filter output, this signifies the finger to advance or retard by a CHEPX8. 
When this happens, the advance or retard command 160a-c is fed back from 
datapath control 308 to the finger front end 312 being serviced by the datapath. 

25 Through microprocessor databus 34, the microprocessor 30 specifies to the 
datapath controller 308 the number of integrations to perform for each offset 
and the number of offsets in the search window. The microprocessor also 
specifies to the datapath control the per cell finger enables for power 
combining, the finger time tracking loop gains, and can also directly write the 

30 lock state of the finger, overriding the value determined by the lock threshold 
comparison. 

As mentioned earlier, the finger front ends, combiner functions, or 
searcher front end are serviced on a first-come-first-serve basis as their 
respective symbol enable strobes 158a-c, 282 or sum_done strobe 216 are 
35 asserted. Datapath control 308 places the requesting elements in a queue to be 
processed as soon as the datapath 300 has finished its servicing of earlier 
requesting elements. If two or more elements request servicing at exactly the 
same time, datapath control 308 arbitrarily assigns one of the contending 
elements to be first in the queue and the other lines up behind it. Since the 
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finger front end and searcher front end outputs are buffered, the datapath has 
a full symbol to service them until the next symbol result overwrites the data 
vector in the buffer. So long as the datapath has extra clock cycles available per 
symbol period, it can always ensure that each finger 12a-c can be serviced 
5 before the next symbol boundary occurs under any worst-case queuing 
scenario. 

During an advance, the finger time tracking loop shaves an 
inconsequential single clock off the interval between successive symbol 
strobes 158. Of more concern is the case when multiple fingers 12a-c are being 

10 slewed in an advancing direction. In this scenario, the fingers 12a-c are 
advancing on a chip per chip basis, so that the interval between successive 
symbol strobes 158 is halved. Rather than having to design the datapath 
sequencing so that it has enough headroom to handle a worst-case queue 
pattern using a 256 clock interval, the finger timing generators 122 simply 

15 suppress their symbol enable strobe output 158a-c during an advance, 
reenabling them once the fingers complete a slew and arrive at their assigned 
offset. 

Once queued, the controller sequences the datapath through a fixed 
processing sequence, configuring it to perform all of the accumulations, 

20 multiplication, and comparisons associated with the signal processing of the 
block being serviced. The type of element being serviced forms a page select 
into microcode ROM 306, and the clock count into the sequencing is used as 
an offset into the selected page to form the microcode ROM address. The 
microcode ROM output specifies the component driven onto data vector tri- 

25 state bus 174, any accesses to or from register file RAM 304, and a control word, 
labeled c[16:0], configuring the internal elements of shared datapath 300. 
Signals c[0], c[2], c[4] form mux select input for MUX's 324, 320, 330 respectively; 
signals cfl], c[3], c[16] are enables for latches 326, 322, 342 respectively; signals 
c[5j,q6],c[7j control the conditional subtract and load functions for adder 334, 

30 and fields c[ll:8] and c[15:12] fully specify the limit and normalization bit 
positions for the output of adder 334. 

The sequence of operations performed by the datapath on successive 
clock cycles while servicing a finger front end 312 is shown in FIG. 10. It 
closely follows the symbol rate processing for the finger discussed in relation 

35 to FIG. 3. For each cycle, the table in FIG. 10 lists the component, if any, driven 
onto tri-state data vector bus 174, the accesses, if any, to or from the register file 
RAM 304, the c[16:0] datapath control word, and a brief comment useful in 
referring back to description of the symbol rate signal processing for the finger 
in FIG. 3. 
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First, the pilot filters are updated by subtracting off a fraction of their 
current level and summing in the on-time I and Q pilot accumulator outputs 
from the selected finger front end during clock cycles 0-6. During clock cycles 
7-9, the dot product is calculated using the filtered pilots and the symbol 
5 accumulation outputs of the selected finger front end. During clock cycles 10- 
13, the cross product is calculated using the filtered pilots and the previous 
symbol's filtered pilot values stored in register file 304. During clocks 14-16 the 
pilot energy is calculated. This energy is written to a temporary scratch 
location in register file 304 while the lock detect filter is first updated by 

10 subtracting off a fraction of its current level during clock cycles 17-18. 

The pilot energy for the current symbol is then read back out and 
summed to yield a new lock detect filter value which is written back to 
register file 304 during clock cycles 19-21. A new lock state is also determined 
by comparison with the in-lock and out-of-lock thresholds during clock cycles 

15 20-21. During clocks 22-24 the late pilot energy is calculated, subtracted with 
the early pilot energy obtained on the previous symbol and read back from 
register file RAM 304 to generate a late - early energy delta metric to drive the 
second order time tracking loop. 

The time track metric is written into the register file 304 and read 

20 immediately back again to position it as an input to the datapath. It is scaled by 
microprocessor specified gain constant Kl as it is loaded into the accumulator 
output latch 342 during clock cycle 27. This scaled value is then added to the 
time tracking frequency accumulator component of the second order filter. 
The updated time tracking frequency accumulator is written back into the 

25 register file 304 and read immediately back again to position it as an input to 
the datapath, where it is summed with the time tracking metric scaled by 
microprocessor specified gain constant K2 during clock cycle 32. This value is 
summed with the time tracking phase accumulator component of the second 
order filter, and the new phase accumulator value is written back to register 

30 file 304 on clock cycle 34. Thus the datapath requires a total of 35 clock cycles to 
process a finger for each symbol. 

The sequence of operations performed by the datapath on successive 
clock cycles while servicing the searcher front end 314 is shown in FIG. 11. It 
closely follows the per integration interval processing for the searcher 

35 discussed in relation to FIG. 4. During clock cycles 0-2, the pilot energy for the 
late pilot integration is calculated. This energy is summed with an 
intermediate non-coherent accumulator value on clock cycle 3, and the new 
sum over the number of elapsed integration intervals is written back into 
register file 304 on clock cycle 4. These same operations for the on-time pilot 
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integration occur during clock cycles 4-8. If the searcher 14 still has further 
integrations to perform at the same offset, the datapath is then finished 
servicing the searcher, as indicated by the solid line drawn after clock cycle 8 in 
FIG. 11. 

5 If, instead, this is the final integration interval for the current offset, 

processing continues. The local maxima filter processing occurs during clock 
cycles 9-12. The datapath 300 determines the slope of the multipath trace 
between the on-time and late offset results, and between the late offset result 
and the prior offset's on-time result which has been stored in register file 304. 

10 If the slope latch transitions from "1" to "0", a local maxima has been detected. 
The datapath 300 can then consider the peak for inclusion in the sorted list of 
the largest peaks found in the search up to that point. 

Starting with peak 0, the strongest peak, on clock cycle 13, and 
continuing through to peak 3 on clock cycle 23, the energy for the current 

15 offset being processed is compared with the stored peaks. If the input energy is 
greater than the stored energy being compared against, the input energy 
overwrites the stored energy, that then simultaneously replaces the input 
energy in accumulator latch 342. By stepping from larger to smaller peaks, 
once the input energy exceeds a stored peak, all of the lesser peaks are 

20 automatically "demoted" a ranking as a matter of course as the peak 
comparison progresses. This is but one of a number of methods well known 
in the art for providing such a sorting function. Any one of them could be 
used within the scope of this invention. The minimum searcher integration 
interval in the preferred embodiment is 32 chips, and with a single 

25 integration interval per offset, worst case, the datapath 300 requires a total of 
24 clock cycles for every 32 chip integration interval to support the searcher. 

The sequence of operations performed by the datapath on successive 
clock cycles, while servicing the combiner functions, is shown in FIG. 12. It 
closely follows the symbol rate processing for the combiner functions 

30 discussed in relation to FIG. 5. One deskewed finger symbol per clock cycle is 
read from register file 304, resulting in a final combined, limited and 
truncated soft decision symbol on clock cycle 3. During clock cycles 4-8, 9-13, 
14-17 similar per finger summations take place on punctured symbols for cell 
0, cell 1, and cell 2 power control decisions respectively. If two symbol 

35 puncturing is used, the combined punctured symbol may be summed with 
the prior combined symbol for the cell being processed, stored in register file 
304. The "or-of-the-downs" OR gate 258 is a discrete gate in datapath control 
308 using the adder sign bit output 346 as a hard up/down decision for each 
cell as they are sequenced through. During clock cycles 19-20, a +1 or -1 based 
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on the combined power decision is added to the TXGAIN value read from 
register file 304. The new TXGAIN value is captured by PDM 276 as it is 
written back into register file 304. During clock cycles 22-24, one finger 
frequency error per clock cycle is read from register file 304 and summed to 
5 produce a new frequency error adjustment, which is added to the LO_ADJ 
value read from register file 304. The new LO_ADJ value is captured by PDM 
274 as it is written back into register file 304. Thus the datapath requires a total 
of 28 clock cycles to process a finger for each symbol. 

The architecture of the present invention has several advantages. For 

10 example, because the complex processing block are shared across a set of finger 
front end blocks, the ability to demodulate an additional signal path can be 
added by simply adding a new finger front end block. The finger front end 
block does not require a significant die area and therefore the cost of extending 
demodulation capability in this fashion is fairly low. With 512 CHIPX8 clocks 

15 per symbol, the datapath has plenty of "headroom", or idle cycles in excess of 
what it needs to perform its assigned signal processing tasks. 

As calculated in the presentation of FIGs. 10, 11, and 12, during a 512 
CHEPX8 symbol interval, the preferred embodiment using three finger front 
ends and a minimum searcher integration interval of 32 chips the datapath 

20 will be processing the fingers for 105 clock cycles, the searcher for 48 clock 
cycles and the combiner for 28 clock cycles, using a total of 181 of the 512 
available clock cycles, corresponding to a 35% utilization factor. Another way 
of stating this is the datapath runs at 3.5 MIPS. This demonstrates the 
significance of the off-loading the simple chip rate functions to the dedicated 

25 searcher and finger front ends; which drops the signal processing requirement 
from 75 MIPS to 3.5 MIPS. This translates directly to power savings, and the 
power consumed by the dedicated front ends adds back only a fraction of this 
amount. For significant extensions in the amount or type of finger and 
searcher processing, or to support higher data rate services with their 

30 correspondingly shorter symbol periods, this headroom can be increased by 
simply increasing the frequency at which the shared datapath is clocked. 

The demodulator architecture embodied in this invention is a hybrid of 
traditional dedicated circuit and general purpose DSP approaches. In 
comparison with the traditional dedicated circuit approach, the shared 

35 datapath consumes less power and is considerably smaller than the discrete 
symbol rate circuits presented in FIGs. 3, 4, and 5. The shared datapath is 
compact and tailored to the modest needs of the processing tasks at hand, 
using 10 bit single precision and 20 bit double precision arithmetic. The hybrid 
approach maintains the flexibility of coding algorithms in firmware instead of 
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dedicated circuitry. The timeline sequencing of FIGs. 10, 11, and 12 form a basis 
for a small microcode kernel; taken together the finger, searcher and combiner 
functions are implementable in under 100 lines of microcode. 

The previous description of the preferred embodiment is provided to 
5 enable any person skilled in the art to make or use the present invention. The 
various modifications to these embodiments will be readily apparent to those 
skilled in the art, and the generic principles defined herein may be applied to 
other embodiments without the use of the inventive faculty. Thus, the 
present invention is not intended to be limited to the embodiments shown 
10 herein but is to be accorded the widest scope consistent with the principles and 
novel features disclosed herein. 
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CLAIMS 

1. A spread spectrum demodulation apparatus for use in a multiple 
2 access communication system, the apparatus comprising: 

a plurality of finger front ends,- each front end receiving spread signals 
4 and performing chip rate signal processing associated with the spread 

spectrum demodulation apparatus; 
6 a buffer, coupled to the plurality of finger front ends, for buffering per 

symbol accumulated data vectors; 
8 a storage device for maintaining state information associated with 

symbol rate signal processing of the spread spectrum demodulation 
10 apparatus; 

an arithmetic datapath, coupled to the storage device and the buffer, 
12 for performing symbol rate multiply and accumulate functions associated 
with signal processing of the spread spectrum demodulation apparatus, the 
14 arithmetic datapath having a symbol output; and 

a datapath control circuit, coupled to the arithmetic datapath, for 
16 arbitrating the use of the arithmetic datapath between the plurality of finger 
front ends. 

2. The spread spectrum demodulation apparatus of claim 1 wherein 
2 the arithmetic datapath comprises: 

a first multiplexer having a first input coupled to the plurality of 
4 finger front ends and a second input coupled to the storage device, the first 
multiplexer selecting a first signal from a finger front end of the plurality of 
6 finger front ends or the storage device, the first multiplexer providing the 

first selected signal at an output; 
8 a second multiplexer having a first input coupled to the plurality of 

finger front ends and a second input coupled to the storage device, the 
10 second multiplexer selecting a second signal from a finger front end of the 
plurality of finger front ends or the storage device, the second multiplexer 
12 providing the second selected signal at an output; 

a multiplier having a first input coupled to the first multiplexer and a 
14 second input coupled to the second multiplexer, the multiplier providing a 

product signal at an output; 
16 a third multiplexer having a first input coupled to the multiplier 

output and a second input coupled to the second multiplexer output, the 
18 third multiplexer providing either the second selected signal or the product 
signal at an output; 
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20 an adder/subtractor having a first input coupled to the third 

multiplexer output and a second input coupled to an arithmetic datapath 

22 output signal, the adder/subtractor providing a sum signal at an output; 

a limiter/normalizer circuit, coupled to the adder/subtractor output, 

24 for selectively limiting the sum signal to a predetermined range, the 
limiter/ normalizer circuit providing a normalized sum signal; and 

26 a latch, coupled to the limiter/normalizer circuit, for storing the 

normalized sum signal thus providing the arithmetic datapath output 

28 signal. 

3. The spread spectrum demodulation apparatus of claim 1 and 
2 further including: 

a searcher front end, coupled between the plurality of received spread 
4 signals and the arithmetic datapath, for calculating signal energy of the 

plurality of received spread signals; and 
6 a symbol combiner, coupled to the arithmetic datapath, for combining 

the symbol output into a demodulated symbol stream. 

4. The spread spectrum demodulation apparatus of claim 3 wherein 
2 the datapath control circuit further arbitrates the use of the arithmetic data 

path between the finger front ends, the searcher front end, and the symbol 
4 combiner 



5. The spread spectrum demodulation apparatus of claim 3 wherein 
2 the searcher front end comprises: 

a pseudonoise sequence generator for generating an I sequence and a 
4 Q sequence; 

a declinator, coupled to the received spread signals, for generating an I 
6 on-time signal, a Q on-time signal, an I late signal, and a Q late signal by 

selectively sampling the received spread signals; 
8 a first despreader coupled to the I and Q sequences from the 

pseudonoise sequence generator and the I and Q on-time signals, the first 
10 despreader generating a first despread I signal and a first despread Q signal; 

a second despreader coupled to the I and O sequences from the 
12 pseudonoise signal generator and the I and Q late signals, the second 
despreader generating a second despread I signal and a second despread Q 
14 signal; 

a plurality of accumulators, a first accumulator coupled to the first 
16 despread I signal, a second accumulator coupled to the first despread Q 
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signal, a third accumulator coupled to the second despread I signal, and a 
18 fourth accumulator coupled to the second despread Q signal, the plurality of 

accumulators summing their respective I or Q signals; 
20 a plurality of latches, each latch coupled to an accumulator of the 

plurality of accumulators; and 
22 a timing generator for controlling the first and second despreaders, 

the pseudonoise sequence generator, and the plurality of accumulators. 

6. The spread spectrum demodulation apparatus of claim 1 wherein 
2 each finger front end of the plurality of finger front ends comprises: 

a pseudonoise sequence generator for generating an I sequence and a 
4 Q sequence; 

a decimator, coupled to the received spread signals, for generating an I 
6 on-time signal, a Q on-time signal, an I late signal, and a Q late signal by 

selectively sampling the received spread signals; 
8 a first despreader coupled to the I and Q sequences from the 

pseudonoise sequence generator and the I and Q on-time signals, the first 
10 despreader generating a first despread I signal and a first despread Q signal; 

a second despreader coupled to the I and Q sequences from the 
12 pseudonoise signal generator and the I and Q late signals, the second 
despreader generating a second despread I signal and a second despread Q 
14 signal; 

a Walsh sequence generator for generating a Walsh chip sequence; 
16 an uncover circuit, coupled to the Walsh chip sequence generator, for 

reversing orthogonal covering of the first despread I and Q signals in 
18 response to the Walsh chip sequence; 

a plurality of accumulators, a first accumulator coupled to the first 
20 despread I signal, a second accumulator coupled to the first despread Q 
signal, a third accumulator coupled to the second despread I signal, a fourth 
22 accumulator coupled to the second despread Q signal, and a fifth and sixth 
accumulator coupled to the uncover circuit, the plurality of accumulators 
24 summing their respective I or Q signals; 

a plurality of latches, each latch coupled to an accumulator of the 
26 plurality of accumulators; and 

a timing generator for controlling the first and second despreaders, 
28 the pseudonoise sequence generator, and the plurality of accumulators. 



7. A method for spread spectrum demodulation in a multiple access 
2 communication system, the method comprising the steps of: 
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receiving spread signals by a plurality of finger front ends; 
4 performing chip rate signal processing, associated with a spread 

spectrum demodulator, on the received spread signals; 
6 buffering per symbol accumulated data vectors of the received spread 

signals; 

8 saving state information, in a storage device, associated with symbol 

rate signal processing of the spread spectrum demodulator; 

10 performing symbol rate multiplying and accumulating functions 

associated with signal processing of the finger front ends; and 

12 arbitrating and sequencing the symbol rate multiplying and 

accumulating between the plurality of finger front ends. 

8. The method of claim 7 wherein the step of arbitrating and 
2 sequencing further includes: 

arbitrating between the plurality of finger front ends, a combiner, and 
4 a searcher front end; 

performing searcher integration interval multiplying; and 
6 accumulating functions associated with the signal processing of the searcher 
front end; and 

8 performing the symbol rate accumulating functions associated with 

signal processing of the combiner. 

9. The method of claim 7 wherein the step of performing symbol rate 
2 multiplying and accumulating includes the steps of: 

multiplying a first signal from a finger front end of the plurality of 
4 finger front ends or the storage device with a second signal from a finger 
front end of the plurality of finger front ends or the storage device to 
6 produce a product signal; 

adding the product signal or the second signal to a feedback signal to 
8 produce a sum signal; and 

limiting the sum signal to a predetermined range to produce a limited 
10 sum signal; 

normalizing the limited sum signal to produce a normalized signal; 

12 and 

latching the normalized signal to produce the feedback signal 

10. The method of claim 9 and further including the step of latching 
2 the first signal and the second signal. 
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11. A radio for communicating in a multiple access communication 
2 system, the radio comprising: 

a controller for controlling the radiotelephone; 
4 a receiver for receiving a radio signal; and 

a demodulator, coupled to the controller and the receiver, for 
6 demodulating the radio signal, the demodulator comprising: 

a plurality of finger front ends, each front end receiving spread 
8 signals and performing chip rate signal processing associated with the 

spread spectrum demodulator; 
10 a buffer, coupled to the plurality of finger front ends, for 

buffering per symbol accumulated data vectors; 
12 a storage device for maintaining state information associated 

with symbol rate signal processing of the spread spectrum 
14 demodulator; 

an arithmetic datapath, coupled to the storage device and the 
16 buffer, for performing symbol rate multiply and accumulate functions 

associated with signal processing of the spread spectrum demodulator, 
18 the arithmetic datapath having a symbol output; 

a datapath control circuit, coupled to the arithmetic datapath, 
20 for arbitrating the use of the arithmetic datapath between the plurality 

of finger front ends; 

22 a searcher front end, coupled between the plurality of received 

spread signals and the arithmetic datapath, for calculating signal 
24 energy of the received spread signals; and 

a symbol combiner, coupled to the arithmetic datapath, for 
26 combining the symbol output into a demodulated symbol stream. 



12. A method for spread spectrum demodulation in a multiple access 
2 communication system, the method comprising the steps of: 

receiving spread signals by a plurality of finger front ends; 
4 performing chip rate signal processing, associated with a spread 

spectrum demodulator, on the received spread signals; 
6 buffering per symbol accumulated data vectors of the received spread 

signals; 

8 saving state information associated with symbol rate signal processing 

of the spread spectrum demodulator; 
10 performing symbol rate multiplying and accumulating functions 

associated with signal processing of the spread spectrum demodulator to 
12 provide a symbol output; 
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arbitrating and sequencing the symbol rate multiplying and 
14 accumulating between the plurality of finger front ends; and 

combining the symbol output to generate a demodulated signal 

2 13. A method for spread spectrum demodulation in a multiple access 

communication system, the method comprising the steps of: 
4 receiving spread signals by a plurality of finger front ends; and 

performing chip rate signal processing, associated with a spread 
6 spectrum demodulator, on the received spread signals. 
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